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Abstract 

We consider a perturbation of an "integrable" Hamiltonian and give an expres- 
sion for the canonical or unitary transformation which "simphfies" this perturbed 
system. The problem is to invert a functional defined on the Lie-algebra of observ- 
ables. We give a bound for the perturbation in order to solve this inversion. And 
apply this result to a particular case of the control theory, as a first example, and 
to the "quantum adiabatic transformation", as another example. 
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We start with a vector space A which we call the space of observables. 

We will apply this theory to 2 main examples: the classical mechanics, in which A is an 

(abelian) algebra of functions defined on the phase space, and the quantum mechanics, in 

which A is an algebra of operators on some Hilbert space. By taking a basis of projectors 

in A we will consider these operators as a collection of "matrix elements", i.e. also as 

functions. 

We assume that A is endowed with a Lie stucture, i.e. there exists a mapping (called the 
"bracket") from A into the space £j{A) of linear operators of A: 

{...}: A-^L{A) 

V^{V} (1.1) 

which satisfies: 



WV,WeA {V}W = -{W}V (1.2) 
and = {V}{W} - {W}{V} (1.3) 

so that the "bracket" mapping is linear in its argument. It is also antisymmetric, by ()1.2|) 
and satisfies the Jacobi identity ()1.3|) . We consider a fixed element H of A which we call 
a Hamiltonian. The "motion" or the "fiow" generated by H is the 1-parameter group: 

VteM e'^^^-.A^A (1.4) 
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The exponential of {H} is defined by the usual power series: 



oo 



{t.H} 



n 



(1.5) 



n=0 



and is an automorphism of the Lie structure, as is proven below in ()5.4|1 . 
Actually what is only needed is that {H} is a linear operator "affiliated" to A which 
means that ()1.4|1 is still valid: {H} is assumed to be the generator of a 1-parameter group 
of automorphisms, even if H is not an element of A. 

Our problem is to find a relation (for instance a "conjugation" ) between the flow generated 
by a perturbation of H, denoted H + V for some V & A, and the flow of H. Poincare 
called this problem "the main problem of the dynamics". Of course this relation would 
be mainly useful if we have some information on the flow of the unperturbed hamiltonian 



We will study 2 types of problems: 

Problem 1 - If we are permitted to modify the perturbed hamiltonian H + V by adding 
a "small" term f{V), we will try to find a relation between the flow of H + V + fiV) 
and the flow of H. Then the term f{V) will be called a "control" term. 

Of course we want to exclude the trivial solution fiV) = —V so we have added the 
supplementary condition on this term to be "small", for instance quadratic in V . So we 
stabilize the perturbed hamiltonian by a small control term, well adapted to the problem. 

Problem 2 - If we are not permitted to modify the perturbed hamiltonian (for instance 
if we investigate the motion of some planet), or we don't want to modify it, then we will 
try to find a good "change of coordinates" , i.e. an automorphism of A, which connects 
the 2 flows: perturbed and unperturbed. 

We will search this automorphism under the form of some exponential e^'"^^ where we 
define F as follows. 

Let us make an important assumption on H, which will be satisfied when H is "inte- 
grable" , as seen below: 

Hypothesis 1 - We assume that there exists a linear operator T : A ^ A such that: 



Hence F is a pseudo-inverse of {H}: Let us remind that it is impossible to find a strict 
inverse of {H} since it always has a non-trivial kernel (for instance {II}H = 0). 
Any element V of A such that {II}V = 0, is constant under the flow of H: 



H. 



{HY F = {H} 



(1.6) 



and then we build two other operators !N" & by: 



y^:={H}T a^:=l-N 



(1.7) 



(1.8) 
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So the vector space Ker {H} is called the set of "constants of motion" . It is a sub-Lie- 
algebra of A since (using ()1.3j) ): 

If {H}V={H}W = Then {H}{V}W = {V}{H}W + {{H}V}W = + (1.9) 

Let us also note that ()1.6p can be rewritten as: 

{H}31 = (1.10) 

which means that the range RgCR of the operator "Ji is included in Ker{iJ}. The 1 is 
the identity in the algebra L{A) of endomorphisms of A. The notation Jl designates 
the "resonant part" and the notation 'N the "non-resonant part". Let us remind that 
{H},Jl,y^,r are elements of L{A). 



Application 1 - Classical Mechanics: 

A is the algebra of Q°° real- valued functions of {p, q) in some domain of M?^ or functions 
of {A^O) G IK X T"^ for some domain K of M.^ (L is the number of degree of freedoms). 
It is more convenient to make a Fourier transformation in 9, and so A may be taken as 
the algebra of functions V of {A, A) G K x into C, such that V{A, - A) = V{A, A)*. 
The Lie structure is the Poisson bracket. We can compute the operators 3^ and F when 
H is "integrable" , i.e. when if is a function of the "actions variables" only, and does not 
depend on the "angles variables" : 

H{A,e) = h{A) (1.11) 
or after a Fourier transformation in 9: 

H{A,A) = h{A).6o{^) (1-12) 

with S the Kronecker symbol. Indeed, let us denote by uj{A) the derivative (gradient) 
of h with respect to A. Then the bracket {H} = uj{A).dg and so, after a Fourier 
transformation, it is given by: 

i{H}V){A, A) = iMA) ■ A).V{A, A) (1.13) 

on any element V of A. We have used a scalar product uj{A) • A . So that the operator 
3i is: 

(W)(A A) = ViA, A).xiu;{A) ■ A = 0) (1.14) 

where we introduce a characteristic function x(c<j(A)-A = 0) which is 1 when uj{A)-A = 
and is otherwise. We could have written it as x(A G uj{A)-^ fl Z-^). Similarly: 

(W)(A A) = ViA, A).x(^(A) • A ^ 0) (1.15) 

Then the action of the operator F is given by: 

(TV)(A,A):^^^^^j^l^^.V(A,A) (1,16) 

See section IHl for more details and section [7| where we introduce a norm. 
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Application 2 - Quantum Mechanics: 

A is the algebra of operators on some separable Hilbert space. More precisely we as- 
sume that A has a unity (denoted by 1), and we consider a maximal family of mutually 
orthogonal projectors Pa € A where A varies in some countable set IK. That means: 

T.Pa = 1 (1.17) 

VA^'gK Pa.Pa' = Pa.Sa,a' (1-18) 

with 6 the Kronecker symbol. and F are easily written when H can be "diagonalized" . 
We choose H as follows: 

H=Y.hiA).PA (1.19) 

for some function h : "K ^ M.. So hCK) is the spectrum of H and the projectors are the 
spectral projectors of H. We can choose h to be the identity function by taking "K as 
the spectrum of H. 

Remark 1 - Reciprocally, if A were endowed with an involution ("*"^ named "adjonc- 
tion") we could have started by giving a self-adjoint operator H with pure point spectrum, 
and then we could have taken the spectral projectors of H, as the family of projectors Pa- 

Then any element V of A can be written as: 

V= Va,a' where Va,a' := PaVPa' (1.20) 

A,A'£^ 

It is convenient to introduce the set: 

g{A) := K - A := {A' - A}A'e:K (1.21) 

and to define: 

VA G K VA G GiA) V{A, A) := Pa+aVPa (1.22) 
i.e. V{A, A) = Va+a,a- For instance: 

H{A,A) = hiA)M^)-PA (1.23) 

which is similar to ()1.12|) . 

The Lie structure is given by the commutator. Actually we multiply it by i/h for some 
constant h which has the correct dimensionality, so that t{H} has no dimension: 

yv,weA {v}{w) ■.= i{v.w -w.v)/h (1.24) 

Let us now turn to the action of the operators {H} and Jl: 

{{H}V){A,A) = i.[h{A + A)-h{A)).V{A,A)/h (1.25) 

Hence: 

i3lV)iA, A) = ViA, A). x{hiA + A) = /i(A)) (1.26) 
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and: 

(W)(A, A) = V{A, A). x{h{A + A) ^ h{A)) (1.27) 
Then the operator F is given by: 



h.xih{A + A) ^ h{A)] 
(rv^)(A A) = . ' ^ ■ A) (1-28) 



See section IHl for more details and section [7| where we introduce a norm. 

Remark: we have introduced an arbitrary constant h in p.24|) . But we will notice 
that the operator F always appears with a bracket around it. And the pair {F . . .} is 
independant of h , since this constant is in the numerator and the denominator. 

2 - The Main Theorems 

Theorem 1 - The control problem (problem 1) is solved by an explicit formula. Let us 
first define the functions F and f : A ^ A by: 

1 - 

FiV) := e-^^^>3^V+ ^^^^ W (2.1) 

fiV) := F{V)-V (2.2) 

Then we have: 

Vt G M e*{^+^+^(^» = e-^r^>.e*{^>.e*^^^>.e^r^> (2.3) 

The meaning of the second term in 1)2.11) is the following: 

l-e^^ _ {-rVT ^ ,,,, (2 4) 

We will prove this theorem in section El This will solve the control problem if we can 
check that f{V) is indeed smaller than V. This will be proved in section [7| To see 
this, it is sufficient to notice that in the expression ^1} of F{W), the terms e'^^^^ and 
(1 - e-^^^^)/{TW} are near 1 when W ^ 0. So: 

F{W) ^31W + y^W := W so that f{W) = 0{W^) (2.5) 

as is seen in ()7.1()j) . 

The formula ()2.3)1 connects the perturbed flow, modified by a control term, with the 
unperturbed flow. 

The new factor, the flow of 51V will turn out to commute with the flow of H: cf. ()5.9|) . 

The second problem (the "change of coordinates" ) is solved by an inversion formula. Let 
us rewrite ()2.3p with F{V) instead of \^ + fiV), and with W instead of V: 

Vt G M e*{^+^(^)> = e-^r^>.e*{^>.e*^^^>.e^^^> (2.6) 
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Theorem 2 - If we can find W such that F(W) = V, i.e. W = F^^iV), then: 

Vt G M e*^^+^> = e-^^^>.e*W.e*{^^>.e{r^> (2.7) 



Here again, the flow of "JIW commute with the flow of H. 

Hence we need to invert the function F. But F is near the identity function around 
0. More precisely we will flnd a ball around in A, for some norm, such that the 
difference between F and the identity function (what we called / in ()2.2|) ) is Lipschitz 
and contractant: cf. (j2.5|) . So F can be inverted, at least around 0. See section [7| 

So to summarize this introduction: the flrst problem is solved explicitly for any pertur- 
bation, but we need to assume some smallness on the size of the perturbation to ensure 
that the "control" term is smaller than the original perturbation. And the second prob- 
lem is solved by an inversion formula, which also needs some smallness on the size of the 
perturbation to ensure that F is invertible. To be more precise, we need some norm on 
the Lie-algebra A: this is done in section [7| 

The equation in the unknown W (i.e. the inversion W = F~^{V)) may be named the 
"Hamilton- Jacobi equation". Indeed it yields the automorphism e^^^\ 

Remark 2 If we replace the hypothesis 1 by a (slightly) stronger one, then the situation 
is clearer, and simpler: 

We assume that there exists a linear operator T : A A such that: 

{Hy r = {H}T{H} = {H} (2.8) 

{H} = T{H}T = r (2.9) 

And we define 4 operators: 

7{:={H}T Ji:=l-7{ (2.10) 

d^:=T{H} :^:=1-:N" (2.11) 

Under this stronger assumption, we easily show that: 

X2 = X Jl^ = Ji X2 = N Jl^ = 5i (2.12) 
Ker!^ = RgK = RgT C KerO^ = Rg?^ = Rg{i7} (2.13) 
KerN = RgD^ = KerT C Ker X = Rg :R = Kei {H} (2.14) 

so that we have a characterization of the sub-Lie-Algebra Ker{if}.- the "constants of the 
motion" are exactly Rg3^. 

Furthermore, the 2 above "set-inequalities" become "set- equalities" if and only if: 

{H}T = T{H} i.e. N = K (2.15) 

In that case i2.8l \2.9) are equivalent to M.(^) . For instance, the assumption is 
satisfied in examples M.l(^) or M.2<^) . 

But we don't need this stronger hypothesis for the rest of this paper. 
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- Localization of the Action Variable 



Let us study the case of the classical mechanics, as above, with action-angles variables. 
Usually, we localize the action variable A near some point Aq i.e. we make the canonical 
change of variables, from {A, 6) to {Ai,6i) with: 

A = Ao + eAi 6 = 01 (3.1) 

along with a rescaling of the hamiltonian: 

Hi{A) ■.= H{Ao + e.A)/e (3.2) 

where we choose some positive constant e. This transformation is called a "canonical 
similarity" since it preserves the symplectic form, up to the multiplicative constant e. 
When H is integrable in the usual sense, with actions-angles variables, we can expand the 
hamiltonian H = H{A) around Aq. 

H{Ao + eAi) =c + e.uj-Ai + q{e.Ai) where u := H'{Ao) and g(0) = g'(0) = (3.3) 

(i.e. q is quadratic in Ai) and where the additive constant c := H{Aq) is not relevant 
and will be forgotten. Hence: 

Hi{A) =uj- A + e.q{A) (3.4) 
where q{A) := q{e.A)/e'^ is of order e^. 

Let us introduce a perturbation V, as above, and choose e := for some norm. 

Under the rescaling (j3.2j) of the hamiltonian, the perturbation is also divided by e and so 
becomes: 

Vi{A,e) ■.= V{Ao + €.A,e)/6 (3.5) 
which is of order e, so that the perturbed hamiltonian is now: 

Hi{A) + Vi{A,e)=u;-A + e.ViiA, 6) (3.6) 

with V2{A, 9) = q{A) + Vi{A, 9)/e (3.7) 

which is of order e^. So to summarize, it is always possible to assume that the integrable 
part is an harmonic oscillator, at least locally in the variable A, i.e. in a region (around 
any fixed Aq) where A (in the hamiltonian (j3.6p ) is of order e^, but it may be wrong when 
A is of order 1/e. Hence in this case of classical mechanics, localized in action variable, 
the operator F is: 

r = ■ with d := de (3.8) 
UJ ■ o 

Let us explicit the action of F on an arbitrary trigonometric observable: 

ABA 

re'^^ = • X(A G uo^) (3.9) 

So that the operator IN" given by p.l5|) becomes here, after a Fourier transformation from 
the angles 9 to the integer vector A: 

(W)(A, A) = r(A, A).x(A G \cu^) (3.10) 
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4 - Non-Resonant Hamiltonians 

We can also define the notion of "non-resonance" as follows. 



Definition 1 - In classical mechanics, H is "non-resonant" iff: 

V 1/, G Ker {H} we have {V}W = (4.1) 
In quantum mechanics, H is "non-resonant" iff: 

VV,P G Ker {if} s.t. = P we have {V}P = (4.2) 

Hence in the classical case: 

yW,V eA {31W}:RV = i.e. {XA}31 = (4.3) 

The above definition means that any 2 constants of motion of H do commute together. 
After the localisation of the action variable, as in the preceding section IHl the above non- 
resonant condition ()4.1|) is equivalent to the "usual" non-resonance condition, which is in 
classical mechanics: 

uj^nZ^ = {0} (4.4) 

i.e. there are no integer vector A orthogonal to uj, except for A = 0. And the operator "Ji 
may be written as the multiplication by the characteristic function x(A = 0). 
We will give an example of a resonant hamiltonian in ()9.1|) . for which we can still apply 
our control theory. 

In quantum mechanics, the non-resonance condition ()4.2|) means that any spectral projec- 
tor of H commutes with any constant of motion. The non-resonance condition is "usually" 
defined for the Floquet case: 

H= {k + h{A)).Pu,A (4.5) 

for some projectors Pfc,A which are mutually orthogonal. For instance the function h{A) 
may be taken as uj.A or ui.A^. The condition ()4.2|) is satisfied exactly when the set of 
the spectral gaps intersects Z in the only point 0: 

^nZ = {0} where g := {h{A) ~ h{B) s.t. A, 5 G N} (4.6) 

For the case where h{A) = uj.A"- for some positive integer a, the condition ()4.6p exactly 
means that the "frequency" u is "non-resonant" in the usual sense. We see that the "res- 
onance" condition is a different notion than the degeneracy property, i.e. the dimension 
of the spectral projectors. 

Let us note that the r.h.s. of (j2.7j) is called the "normal form" of the (perturbed) flow on 
the l.h.s. When H is "resonant", then ()2.7p is called the "resonant normal form". 
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5 - Proof of Theorems 1 & 2 

Actually the theorems ()2.3j) & ()2.7j) are 2 different interpretations of the same formula 
()2.Hj) or ()2.fij) . Let us first prove: 



Proposition 1 

VVTGyi H + F{W) = e-^^^\H + 31W) (5.1) 
Proof: indeed from the definition ()2.H1 and ()1.7j) . F{W) can be rewritten as: 

F{W) - e-^^^^OlW = ^^^^ {H}TW = - ^^^^ {TW}H = e'^^^^H - H 

(5.2) 

where we used the antisymmetry p.2|) . Q.E.D. | 

Proof of (j2.6p : Let us now take the brackets of the 2 sides of ()5.ip : 

{H + F{W)} = {e~^^^HH + 5lW)} (5.3) 

But: 

yV,W eA {e^^^W} = e^^>.{iy}.e-^^^ (5.4) 

Indeed: 

WV, W eA, WneN {{VyW} = J2 (?) ■{V}''-'' .{W}.{-V}'' (5.5) 

The proof of ()5.5|) is an easy recurrence from the case n = 1: cf ()1.3|) . Hence: 

+ F(iy)} = e-^^^^{if + W}.e^^^> (5.6) 
Let us now exponentiate the 2 sides of ()5.6|) (multiplied by any t G M): 

^t{H+Fiw)} ^ exp[t.e-^r^>.{if + W}.e{r^>] = e-{ri^}.e*{^+^^>.e^r^> (5.7) 
where we have used: 

\/A,B e^''-^-^ = A-\e^.A (5.8) 

To finish the proof of ()2.6p . there remains to show that {H} and {CRiy} commute, in the 
algebra of linear operators on A. But from p.3|) . p.lO|) : 

- = {{H}'JIW} = (5.9) 

The formula ()2.6|) is proven. And also the theorems 1 and 2 which are only some rewrit- 
tings of it. Q.E.D. | 
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6 - The 2 Main Applications 

Application 1 - Classical Mechanics: 

Here, an automorphism is called a "canonical transformation". 
First we have to explicit the Poisson bracket: 

V(AA)eJ{xZ^ (6.1) 

i{W}V)iA,A) =i (w'{A,A-A')-A').V{A,A')-(v'{A,A-A')-A').W{A,A') 

where W is the derivative (gradient) of W with respect to A. And we have used a scalar 
product Vr'(A, A — A') ■ A'. If H is "non-resonant", the action of the operator Jl on any 
observable W consists in keeping (in the Fourier coefficients of W) the only term with 
A = 0, i.e. the average over the angles variables 6. So "JIW is a function of A only (i.e. 
also "integrable" ) , and any 2 functions of A only, do commute mutually. Then the flow 
gt{3iw} jg q£ ^j^g same type as the flow e^^^\ which is: 

This is the operator translating the variable 9 by t.uj{A): 

(e*^^>\/)(A, 9) = V{A, 9 + t.uj{A)) (6.3) 
or after a Fourier transformation in 9: 

(e*^^ V) (A, A) = e'*-"(^)-^.\/(A, A) (6.4) 
And similarly for e*^^^^ with uj{A) replaced by {%W)'{A) = {JIW'){A). 
Application 2 - Quantum Mechanics: 

When A has an involution ("*"^ named "adjonction": cf. Remark 1), and when H = H* 
the associated automorphism is called a "unitary transformation". 
The bracket is defined as: 

VA G J{ VA G g{A) (6.5) 
{{W}V){A,A) W{A + A',A-A').V{A,A')-V{A + A',A-A').W{A,A') 

We can put this bracket in a form similar to the Poisson bracket 1)6.111 . by adding and 
substracting 2 terms: 

VA G J{ VA G GiA) (6.6) 
^W{A + A', A - A') - W{A, A-A'Y 



i{W}V)iA,A)= Y 



h 

ViA + A', A - A') - V{A, A - A') 



■V{A,A') 



h 



.W{A, A') + {W{A, A')}V{A, A - A') 
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where the last term is a short notation for: 

. \ W{A, /\').V{A, A -A')- V{A, A - A').W{A, AQ l 

h 

The formula ()6.6|) is reminiscent of the famous "correspondance principle" between the 
classical mechanics and the quantum mechanics: if the set J{ becomes more and more 
dense in M (for instance if it is HZ) with h 0, in such a way that A remains constant, 
then the last term (j6.7|) tends to and the first term in bracket tends to the derivative 
of W with respect to A, multiplied by A'. So that the quantum bracket becomes the 
classical one. Cf. PP for a precise formulation of this fact, in some particular cases. 
From now on, we will choose h = 1, since we will not use this semi-classical limit. 
Here again, if H is "non-resonant", is a diagonal matrix, and so its flow is of the 
same type as the flow of H, since: 

(e*^^>\/)(v4. A) = e''-^HA+A)-hiA)]y^^^ ^g g^ 

and similarly for the other flow. 



7 - Quantitative Estimates 

We start by chosing an arbitrary norm on A. And we replace A by its closure with respect 
to this norm. We deduce a canonical norm for the operator F which acts bilinearly on A: 

|||F|||:= sup ||{ri^}V^|| (7.1) 

v,weA s.t. iiyii=||vKii=i 

We make an important assumption: 
Hypothesis 2 

|||F|||<oo (7.2) 

This hypothesis is necessary to be able to apply the so-called "local bijection theorem" 
to invert the function F. When fl7.2|) is not true, it can be replaced by a weaker one, 
but then we need to use the more complicated theorem of Nash-Moser, cf [12], [H], 0, 
j7] , or the Newton iterative method, as in the KAM theory. They are based on a Frechet 
structure on A i.e. an infinite sequence of norms || . . . \ \s-i > II • • ■ lis instead of only 1 
norm. The hypothesis (|7.2j) is still required but with a weaker norm: 

|||F|||„ := sup sup \\{TV}W\\s . a{s) (7.3) 

sGN V,W€A s.t. ||V|U-i = ||Vl^||s=l 

where a : N — M.^ and lim^^oo '^(■s) = 0. Hence F may be bounded if we admit some 

loss of "regularity". 

Let us also define a norm: 

||P||||:= sup (7.4) 

W£A s.t. \\W\\=1 

Another assumption, is on the operator 01: 
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Hypothesis 3 

11113^1111 <1 (7.5) 
It is fulfiled for many norms, for instance tliose given in (jlU.lj) : cf (jl(J.2j) . 
Then F is invertible when \\V\\ is small enough: 

Theorem 3 - Let V be an element of A and T be defined by or explicitly by 
M.lb]) or M.2t^) . Under the hypothesis 1, 2 and 3: 

If l|V|l < (7.6) 

Then HI < < Hi (7.7) 

35 - \\V\\ - 13 ^ ' 

Proof: Let us start by expanding f{W) as given by ()2.2|) in series: 

m) = f:{-Twr-'^^-w (7.8) 

n=l n -I- i. 

and use the definition (j7.1|) : 

||{riy}r|| < |||r|||.||vr||.||i/|| (7.9) 

and the hypothesis ()7.5|) . so that: 

\\f{W)\\ < EdlirilMliyil)"- ■ IIW^II < (ellirilMWI - i).||l^|| (7.10) 
n=i n + I. 

This proves that f{W) = OiW"^). To compute the derivative of f(W) with respect to W 
we need the following formula, valid for any derivation d G Der(£) of some algebra £: 

de^= /'rft.e*-^.5r.e(i-*)-^ (7.11) 





which may be rewritten as: 

de^=[^^j^dVj.e'' (7.12) 

where the Lie-bracket {. . .} is given by the commutator. Indeed: 

e^.W.e-^ = e^^^W (7.13) 
The proof of ()7.12|1 is obtained by power expanding the exponentials: 

Vn G N 9(V") = J2 ■{V}'"\{dV).V''-'' (7.14) 

which is proven by a simple recurrence. Similarly for (j7.1ip : 

N=0 ■ n,k=0 ^ ^ 

13 



whereas: 



[ dt.e'-^'.dVJ'-'^-'' = E / dt ~ ' V^dV.V'^ (7.16) 
These 2 expressions coincide after using: 

(T.17) 

n!.A;! (n + A; + l)! ^ ^ 

Let us note that ()7.1ip is a generahzation of the following formula (valid when t varies in 
a finite set) to the case where t is a continuous variable: 

9(^y) = T.illVr){dVt)(llVr) (7.18) 

\ t / t \T<t I \T>t I 

When dV commute with V we retrieve the usual formula: 

= {dV).e'^ = e^.idV) (7.19) 

Let us apply ()7.11|) to the Lie algebra £ = L{A) of the endomorphisms of A (the space 
of observables) , for which the bracket is indeed the commutator. And we take for d the 
derivation with respect to W: 

ae^r^> = rft.e*-^r^>.{r . . .}.e(i-*)-^r^> (7.20) 
Jo 

which can be rewritten as: 

where the double bracket is the bracket in L{A) i.e. the commutator: 

yXeL{A) {{TW}}X ■= {TW}.X -X.{TW} (7.22) 



Therefore: 



where: 



F'{w)v = (^^[{{rw}}){rv}^e-^^^^:kw + e-^^^^:R.v + 

{ ^~{TW}^ y^+ (^v{{{rW}}){TV}y-^^^^J^W (7.23) 

^(X) := V{X) := /' ds.^isX) (7.24) 
A Jo 



Let us now use the formula (|7.9p : 

||e^r^>.V|| <ellir|ll-ll^ll.||y|| (7.25) 



so that: 



||9e^r^>.V^|| < rf/t.e*-llirill-ll^ll.|||r|||.e(^-*)-llirill-ll^ll.||\/|| = ellirill-ll^ll.| | |r| 
Jo 



(7.26) 
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And we don't need the formula ()7.23p . Hence f'{W) has a norm bounded by: 

Iim)ll < E^-(llir|IMIw^lirMlir|||- ^''''^''''^^ -111^11 + 



ra=l 



n + 1! 

(llir|||.||^ll)n.!lMll (7.27) 

so that, using ()7.5p : 

||/W||<elll^lll-ll^ll(|||r|||.||iy|| + l)-l (7.28) 
Let us call 7 the solution of the transcendental equation: 

e^(7 + l)-l = l i.e. 7 = 0.3748225258118948 ...> 13/35 (7.29) 

Hence: 

If |||r|||.||Vr|| < 7 Then \\f'{W)\\<l (7.30) 

and so: 

\\F'{W)\\ = 111 + f\W)\\ > 1 - \\f'iW)\\ > (7.31) 
Then F is invertible and: 

\\F{w)\\ = \\w+f{w)\\ < \m.[i + ^^^ 

< ||iy||.(l + e^-l) = e^.||iy|| (7.32) 

Likewise: 

\\F{W)\\ > \\W\\.(l - **||^^|** ) > l|W^II-(2-e^) (7.33) 
So if we replace W by F~^(y) we get: 

llF"^fV")|| 1 
e"^ < — < —— (7.34) 

- Ill^ll - 2-e^ ^ ^ 

under the condition ()7.30|) . i.e. if: 



Indeed we will have in that case: 

l|W^II = l|i^-^(^)ll<^<^ (7.36) 

To prove the theorem 3, there remains to use the value of 7: e"^ = < ||. Q.E.D. | 
Let us note that under the condition ()7.fjj) . the new term (the "control term", fiV)) will 
be smaller than V (cf. dZHDI)): 

If ll^ll<n^ Then Hr^<e'^'-l (< l) (7-37) 
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8 - A Formal Series for the Inverse of F 

We want an expansion of W = F~^{V) in powers of V, using the expansion of F{W) = 
W + f{W) given in ()7.8|) . For that purpose we can use an extension of the Lagrange 
inversion formula which was estabhshed to invert a function from C ^ C in power series. 
An extension to the case where the argument is not a number but a function (hke here), 
was given in ^1], and latter in We first rewrite the definition of ly as a fixed-point 
problem: 

F{W) = V ^ W = G{W) :=¥ - f{W) (8.1) 
A Taylor expansion of G is given by: 

G{W) = Y.G{n)W'' (8.2) 

n>0 

with (cf. EHI): 

G(0) := G'(O) = V and G(l) := (8.3) 

V„, > 2 := ^ = S(-l)-.{r . . .}"-' ■ . . . (8.4) 

is an n-linear completely symmetric application from into A. Its n arguments are sym- 
bolized by the n "slots". In (j8.2p . this n-linear mapping is applied to identical arguments: 
n times W. When applied to n general arguments Wi, . . . , Wn, it would give: 

G{n){W„ . . . , ly.) = H-^nrW,} . . . {riy„_ J ■ + V . (8.5) 

nl 

The operator S is the symmetrization of the arguments which yields, when applied to an 
n-linear application T: 

{§T){W,,...,W^):=^ T(iy.(i),...,iy.(„)) (8.6) 

(tS Permutations 

We will omit the parenthesis and the commas in using such tensors. So W'^ = 
{W, W,...,W) (M times). For instance: 

G'(0)a..):^Um ^'^'^->-^'«> (8.7) 

which is 0, for the function defined in ()8.1|) . since it is quadratic in its argument. And 
G"{0) is a tensor of order 2. 

G-miWu W,) := lim lim GjXW, ^ ,W,) - GiMV,) - Gj.W,) ^ G(0) 

a^om^o A./i 

which can be easily computed to be: 

G{2)Wi.W2 = {TWi}^^W2 + {TW2}'^^Wi (8.9) 
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Theorem 4 - The above mentionned extension of the Lagrange inversion formula says 
that the solution of any fixed-point equation W = G{W), with a function G given by a 
series iS.^) . is formally: 

W'=E E G{u^-i)GiuN-2)---G{u,)G{iyo) (8.10) 

Af>l ue7{N) 

where: 

T(iV) := {u = (z/o, i/i, . . . , z/jv-i) G {0, 1, . . . , iV - 1}^ s.t. 

VA; e {0,l,...,iV- 1} : \u\k<k and \u\n_i = N - 1} (8.11) 

with: 

W\k ■= + 1^1 + ■ ■ ■ + i^k (8.12) 



The expansion ()8.1()|1 is only useful if G{n) are "small", since we expand in powers of 
them. In our case (|8.3p . (|8.4|) . only G(0) is small, and (5(1) = 0. But G{n) is of order 1, 
when n > 2. So we have to rearrange the series ()8.10p : 

W = V+Y.Wm (8.13) 

M>2 

where: 

Wm:= E E Gii^^.,)G{uN-2)...G{i^i)Gii^o) (8.14) 

N=M+1 iye7{N) s.t. j/-i(0)=M 

with the restriction "z/~^(0) = M" in the sum over z/, meaning that u must take exactly M 
times the value 0. Indeed this is exactly the case "z/fc = 0" which produces a factor V. We 
can also notice that in the definition ()8.11|) of T(A^) we can restrict i' G {0, 2, 3, . . . , A^ — 1} 
since in our case G{1) = 0. Before we explicit the first terms of the solution (j8.13|) . 
let us understand heuristically the series (j8.1(J|) or (j8.13|) . A first approximation of the 
solution of W = G{W) is W ^ G{0) := V. Then a second approximation is W ^ 
GiG{0)) ^ G(0) +G"(0).G(0). And a third one, at the order , is W ^ G(G(G(0))) ^ 
G[G(0) + G"(0).G(0)] i.e.: 

W ^ G(0) + G\0).G{0) + G"(0).G"(0).G'(0) + ^.G"'(0).G'(0).G(0) (8.15) 

and so on. We are building the series (j8.1(J|) . Now we have to check that the general term 
of (j8.1(J|) is indeed an element of A and not an arbitrary m-linear mapping for some m. 
Each term is a composition of high-order tensors, and this product is actually a "vector" 
i.e. an element of A. This is due to the definition of 7{N): we note that G{n) is of type 
V.V"' i.e. it is a (sum of) tensorial product(s) of a vector V and of n covectors V. More 
generally we say that a tensor is of type V"^.V"' when it is n times covariant and m times 
contravariant. This is a convenient way to consider such tensors, in order to keep track of 
all subsequent contractions. Of course the ordering is crucial: V.V is a matrix (a tensor of 
order 2) whereas V.V is a scalar (a scalar product). Each time a covector follows (on the 
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left) a vector, we make the contraction, i.e. we reduce the tensorial product to a scalar. 
Actually we just want to give a meaning to the composition of multilinear applications. 
To this purpose we can define: 

V5GL(yi",yi) vcGi:(/i™,yi) s.c g (s.ie) 

by: 

{B.C){Vi, . . . , Vjn, Kn+1; • • • ; Kn+n-l) '■= B(^C{Vi, . . . , Kn), Kra+1, . . . , ^ 

(8.17) 

Now we note that for any u G 7{N) : uq = since |z/|o = 0. So the right-most factor in 
(jOTHl is a vector: G(0) = G ^• 

Then the preceding one is G(z^i) where z/i < 1 since |z/|i = z/q + z/i has to be less than 
1. Hence G{i>i) is either a vector (if ui = 0) or a matrix (if ui = 1). In the latter case 
the product G'(1)G(0) is a vector, and in the former case we have a "bi-vector" which 
will be made into a vector by the higher-order terms: indeed if z/i = then there exists 
a k > 2 such that z/^ > 2 since |z/|jv-i has to be — 1. More precisely we note that 
the product of a tensor V^.V^ by a tensor V^.V'^ is a tensor of type: 

■^a 'Y'b "^c 'Y'd -^a+c— min(b,c) -^b+d— min(fe,c) j^g^ 

So the product G{vk) ■ ■ ■ G^Uq) is a tensor of type: 

G{uk) ■ ■ ■ G{u^) = r^^^-e^-l^l^'^^.r^ (8.19) 

i.e. without any covector, since we assume \v\k < k \/k. Finally for k = N — 1, |i^|Ar_i 
has to be — 1, so ()8.1U|) is indeed a sum of vectors. 

The idea of the proof of (j8.1(jp is to plug the r.h.s. of (|8.1(jp into the expression (j8.2j) of 
G{W) and to recognise that we get the same expansion than (j8.1(jp . i.e. this is also W. 

A simple illustration (which is not of our type) of the theorem 4 is when: 

G{W) := V.e^-^ (8.20) 

for some V ^ A and Y & A* := £(^1, M). In that simple case, the theorem 4 gives the 
solution of the "transcendental" equation in W: 

W = V.e^-^ (8.21) 

as: 

W = V.\{Y ■ V) where A(x) := ^ ^ / (8.22) 

n>0 ^• 

So that A(x) converges if |x| < (but may be extended if a; < — e^^). 
We can rearrange the series ()8.13|) as follows: 

M-l 

Wm := E E E G'(A0 . . . G{X%^^) (8.23) 
A'=i Aes(Af,M) /^ee(Af,A) 

where G is defined in ()8.3|) and ()8.4|) . and: 

S(A^,M) := {A G Nf s.t. |A|^ = M-1} (8.24) 



18 



and: 

G{N, X) := {fi E s.t. yi<n<N < n + | A|„_i} (8.25) 

with: 

l/^ln := /il + • • • + /in (8.26) 

ans similarly for |A|„ with |A|o := 0. Finally in ()8.2Hj) the integer Af is defined for any 
1 < Z < + M by: 

Af = An + 1 if / = \fl\n 

= otherwise (8.27) 

This means that Af = when / is not an element of the set of the values |yu|„, when n varies 
in {1, . . . , A^}. This case occurs M times when / varies in {1, . . . , iV + M}. Otherwise 
when I = \fi\n for some n, then Af = A„ + 1, for this n. 

Finally we can also rearrange the Lagrange series in our specific case ()8.3j) & ()8.4|1 into: 

Wm:='y: E E (-i)^+^^^Hr...r'^^f^^--.-{r...r^^i^T/- 

N=l AeS(Af,M) u(^D{N,\) ~^ 

(8.28) 

where: 

D{N, A) := {u E s.t. = M k 1 < n < N |z/|„ > |A|„} (8.29) 

Let us note that: 



iri 



|n-l 



and so: 

IIw^mII <CM.|||r|||*^-\||\/||^ (8.31) 

for some positive constant cm- Indeed: Ai + ... + A7v = M — 1. 

Let us explicit the first orders of the expansion of the solution W of our problem 1)8.111 - 
(IH3I), as given by ()OT^ or 

F^\V) = W = Y.Wm where Wi = V W2 = {TVj'^^^V (8.32) 

M>1 ^ 

= ({rv}^) V + {Tvf^^v - {rv-}'?^{ry}^v- 
{r(,m^.)}^,r.,^.-{r(,r.,?^±i.))^.- 



19 



and so on for W5. . . We have proven in Theorem 3 that this series converges at least when: 

IIV^II < (8.33) 

The paper [8] proves that a series similar to this one (the Lindstedt series, which is local 
in the variable A) converges. The key ingredient is the compensation between the terms 
of different signs. 

Remark: as we have noticed at the end of section ^ the solution W is independant of the 
constant h since the operator T always appears with a bracket around it. And the pair 
{r . . .} is independant of h . 



9 - Example 1: Classical Control Theory 

We will study a simple model, introduced in [in] and described in of a charged 

particle in a plasma, in a Tokamak, which is a reactor for the controlled thermonuclear 
fusion. We consider a section of the Tokamak as the phase space of a dynamical system 
with 1 degree of freedom. So that the particle has 1 degree of freedom (p, g), but it is 
embedded in a complicated electric field, depending on time. The fast motion given by 
the strong magnetic field has been averaged out. The hamiltonian is: 

H{p,q,E,T) + V{p,q,E,T) where H{p,q,E,T) = E (9.1) 
and V{p, q, E,t) = ^ e^^J^ sin(ng + mp + kr) (9.2) 

The extended canonical coordinates are {E, r) so that the motion of the new dynamical 
variable r is trivial: 

e*-{^+^V = r + t (9.3) 

Here we have applied the flow to the observable r : [p, q, E, r) i-^ r. And of course V 
is independant of the variable E, which is the variable canonically conjugate to r. Let 
us note that in ()9.2j] . the variable k must be integer (or at least away from 0) but the 
variables m, n may be integer or real numbers: in that case the sum over them should be 
replaced by an integral. 

The hamiltonian H is resonant since {H} = 8^- so that: 

{:RV){p,q,T)= <fdT.V{p,q,T) (9.4) 

is independent of r. Indeed a Fourier transformation in r yields: 

{5lV){p, q, E, k) = V{p, g, E, 0). x{k = 0) (9.5) 
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For the perturbation ()9.2|) we have: "JiV = 0. The action of the operator F is defined by: 



{TV){p,q,T) 



■ cos(ng + mp + kr) 

K 



(9.6) 



Then the "control term" fiV) can be exphcitly computed, cf (j7.8|) with JlV = 0: 

fiV) = J2fs where /, := ^ -^V 



s>2 



Hence: 



where: 



with: 



fiy)iP,Q,T)= £n,m,k- sm{nq + mp + kr) 



^n,m,k ■ ^ ^ ( 1 
s>2 



:S—1 n,m,k 



(9.7) 



(9.^ 



(9.9) 



M ._ 

'n,m,k ■ 



E 



K 



N+n,M+m,K+k ^N-n,M-m,K 



(s-l) 



The proof of ()9.1()|1 is based on: 

|cos(A^g + Mp + Kt)^ sin(ng + mp + kr) 
1 



A.{M.n- N.m) (9.10) 



{N.m - M.n) ■ 



sm[{N + n)q + (M + m)p + {K + A;)r] + 



(9.11) 



sin[(A^ - n)q + (M - m)p + {K - A;)r] 
so that we can iterate and compute {FV}* V. 

A simple case where the control term can be computed explicitly is the following. Let us 
choose: 

€ER*, b>l/V2, a = ±l, m,nER s.t. m =^ n (9.12) 
And we take K as a particular case of ()9.2|) . a sum of 2 waves: 



V{p, q, r) = e.a.V2b'^ — 1. sin(ng + mp + t) — e. sin(g + p + r) 



(9.13) 



We have taken m ^ n to avoid that V depends only on a single variable q + p. The role 
of h (and a) is to permit 2 different coupling constants. Then the control term is a sum 
of only 5 waves: 



fiy) = e .a. sin( (n — l)q + (m — l)p) + 



V262 - l.^sin(g + p + r) +sin((2ra- l)g + (2m- l)p + r)^ - 



a. sin(ng + mp + r) + sin((2 — n)q + (2 — m)p + r 



(9.14) 
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where: 



(26= _ 1)./. fl-.o.i.A(,n-n))Y'^ ^ ,,|.|,,„ _ (9,15) 



b \ I'm — n\ J ^4 

(452_2)i/6 / sm( 6. b.(m~n))Y^^ , ,2/, /262 - K i/e 

b \ m-n I ' 'V18/ 

We have indicated the first order of the expansion of e or e when e is smalL So that 
fiy) = o(v^). 

See also jl] , jH] for some numerical experiments that prove the effectiveness of this method, 
when the coefficients en,m,k are taken to reflect some properties of a realistic field, in a 
Tokamak: 

en,m,k = , , ■ X(l < + < iV^). = 1) (9.17) 

for some constant e proportional to the inverse of the (strong) magnetic field and for some 
"cut-off" A^. In that case the first term of the control, £nrnk vanishing when 7^ 0. 
In j3] we also give some quantitative values of the parameters for the rigorous applicability 
of this control. 

Let us summarize the method of control, in this case, where JIV = and {H} = dr'- 

\/teR e*^^+^+^(^)> = e-^r^>.e*^^e{r^> (9.18) 

so that the distance between the dynamical variable p (or q ) at the time t and at the 
initial time is: 

P.-Po=(e*^^+^-^^(^»-l)po (9.19) 
Let us write p instead of po, and use the "telescopic" formula: 

Va,6,c: a.fo.c- 1 = (a - 1) + a.(6- 1) + a.6.(c- 1) (9.20) 
And we replace the flow in (j9.19j) by its decomposition (j9.18j) : 



Pt-P 



(e-{m - 1) + e-^r^>.(e*^- - 1) + e-^r^>.e*^^(e^r^> - 1) 



p (9.21) 



But the middle term vanishes since drP = 0, so that: 

p,-p= (e-^r^> - l)p + e-^^^>.e*^^(e^^^> - l)p (9.22) 
And we can divide and multiply by {Ll^}, and use the antisymmetry (jl.2j) : 

Pt-P= ( ^ry} )wri^-e-^"^>.e^"-.(^^^)Mn^ (9.23) 

Finally, let us note that {p} = —dq, i.e. the formula ()9.23|) can be explicitly computed. 

When we apply an approximate control term (/? instead of the exact one f{V), the formula 
()9.18|) becomes: 

^t{H+v+.fi} ^ ^-{TV} _^-{rw} _^tdr _^mw} _^{rw} _^{rv} ^g ^^^ 
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where: 

W:=F''[e^^''\{if-f{V))'^ (9.25) 

When we take (f = 0, ()9.25p is the begining of the KAM recursive method. See (p^) for 
some extensions and numerical tests of this theory for some dynamical systems. See also 
j7] for more details, in the case of quantum mechanics. 



10 - Example 2: Quantum Adiabatic Transformation 

An example of typical norm on A is given by an arbitrary "weight" function g{A, A) > 0. 
Let us define a norm on the Lie-algebra A by: 

||\/||:=supEinAA)|MAA) (10.1) 

A A 

In the quantum mechanical case, we will take the usual L^-operator norm on each "block" 
V{A,A). This choice is irrelevant in the case where all the projectors P4 are finite- 
dimensional. We could have chosen sup^ ^ but this would have just been (approxi- 
mately) a modification of the "weight" function g. 

Lemma 1 We have: 

||W||<||1/|| & ||W||<||V|| (10.2) 

Proof: The operators "Jl and N are implemented by some characteristic functions: cf 
()1.14|) . ()1.15p . ()1.26|) . ()1.27p . So the sup & sum in ()10.H) are restricted by some conditions: 
therefore the norm decreases. Q.E.D. | 

Hence the hypothesis 3 is fulfiled. Let us consider as before a hamiltonian H but where 
now the perturbation depends on time. We need to extend the algebra and the full 
hamiltonian is now: D + H where D is the "derivative with respect to time" . We can 
apply the proposition 1 1)5.11) but we still use the same operator F i.e. the pseudo-inverse 
of the bracket with H, and also the same function F defined in ()2.1|) . Let us rewrite ()5.ip : 

yWeA H + F{W) = e-^^^\H + :kW) (10.3) 

Now we want some information on D + H + V: 

D + H + F{W) = D + e-^^^\H + W) = e-^^^>(e^^^>D + H + "JIW) (10.4) 

where: 

W := F-\V) (10.5) 

Hence: 

/ e^"^^^ - 1 \ 
D + H + F{W) = e-^^^^ \D + ^^^^ .{TW}D + H + JiW j (10.6) 

so, using ()1.2p . and the notation W := {DjVT: 

D + H + V = e-^^^^[D + Hi + Vi) (10.7) 
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with: 

H,:=H + ^W V,:= ^^^-^^ (10.8) 

Indeed {D]TW = T{D}W = TW. The formula (ITirTj) is useful if the derivative V of the 
perturbation with respect to time is "smaller" than the perturbation V itself. This is the 
"adiabatic hypothesis" . In that case we can also show that W is small, since W ^ V. So 
the new perturbation is "smaller" than the original perturbation, or more qualitatively it 
is approximately: TW i.e. {H}~^{D}V . This means that it has losed some regularity 
in time, but gained some regularity in its "spatial" dependance, since generally {H}~^ is 
a regularizing operator. The difference between H and the new Hi is of the same size as 
the original perturbation, but "diagonalized" with respect to H since it commutes with 
it. Cf also jH] for an iterative proof of this result. 
Let us finally note that it is possible to iterate this procedure: 

D + H + V = e~^^^^e'^^'^'^[D + H2 + V2) where Wi := F^^Vi) (10.9) 

and: 

H2:=Hi + 5liWi V2:= ——^.TiWi (10.10) 

and with !Ni, Fi, Fi defined as before, but with H replaced by Hi. And so on. Gen- 
erally this iteration doesn't converge: so we must stop it at an optimized order. This has 
been done in [lOj, with a different method and framework. 

Finally let us explicit the norm ()7.1|) of the operator F for the norm chosen in pO.lj) : 

ll|r|||< sup (10.11) 

- A,A' s.t%A' \hiA)-hiA')\ ^ ' 

where: 

^(A^O ■■=9A,A'- supmaxf^ , ^-^] (10.12) 

A" \gA',A" gA,A" I 

and gA,Ai = 9 [A, A' — A). The proof is an easy estimation of the norm (jT.lj) . Hence the 
hypothesis 2 ()7.2p requires that: 

^A,A' s.t. Ay^ A' \h{A)-h{A')\>-f.^{A,A') (10.13) 

for some constant 7 > 0. This is a "Diophantine" condition. It depends on the choice of 
the weight function g, i.e. on the regularity of the perturbations V we want to consider. 
For instance, when "K = N*: 

If gA,A' = ^A- 'fA' then i){A, A') = m&x{Lp% Lp\,) (10.14) 

As a particular case we can take, for some a > 1: 

h{A) = A" and = A^ (10.15) 

for which the condition ()10.13p is satisfied. 
See jTj for more details. 
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